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A PARTIALLY DISTRIBUTED CONTROL MECHANISM FOR SCANOUT 
INCORPORATING FLEXIBLE DEBUG TRIGGERING 

This application is a continuation-in-part of U.S. 
patent application serial no. 09/677,392 filed September 29, 
5 2000. 

BACKGROUND OF THE INVENTION 



1 . Field of the Invention 

The invention relates to error detection/correction and 
10 fault detection/recovery. More particularly, the invention 



relates to digital logic testing Jthrough a reduction in the 



ng)t 



r3 „„ — - . 

■ % Q number of global test control lines. 

fS 2 . Background Inf ormation 

t s{ 15 An integrated circuit or "chip" is a microelectronic 

J\ semiconductor device having many interconnected transistors 

;~ and other components. Chips may be fabricated on a small 

fy 

rectangle cut from a silicon wafer. The small size of these 

ij 

12 circuits allows high speed, low power dissipation, and 

20 reduced manufacturing cost compared with board- level 
integration. 

The first integrated circuits contained only a few 
transistors. Small Scale Integration (SSI) brought circuits 
containing transistors numbered in the tens. Later, Medium 
2 5 Scale Integration (MSI) contained hundreds of transistors and 
Large Scale Integration (LSI) contained thousands of 
transistors. At present Very Large Scale Integration (VLSI) 
circuit chips are composed of hundreds of thousands of logic 
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elements or memory cells. Digital VLSI integrated circuits 
may contain anything from one to millions of logic gates - 
inverters, gates, flip-flops, and multiplexors on a few 
square millimeters. 
5 Part of producing a scaleable VLSI chip includes testing 

and debugging the chip. Debugging is an attempt to determine 
the cause of any malfunction symptoms detected by testing. 
Circuitry may be built into an integrated circuit to assist 
in the test, maintenance, and support of an assembled 

10 circuit. Hardware features, known as design for test (DFT) 

features or resources, may be incorporated into a chip to aid 
in testing and debugging. 

Determining the cause of a malfunction or other problem 
may be achieved by using a testing machine to send a 

15 simulated signal from a debug pin residing on the perimeter 
of the chip to a logic element within the chip so as to 
trigger a response bit (0 or 1) from that logic element. On 
a clock signal, an instruction may cause a snapshot or "scan" 
to be taken of this triggered response bit by a DFT feature. 

2 0 On the next clock signal, and as part of that same 

instruction, the scan information bit may be shifted one bit 

ou+fu+ on 4Kc 

Qs^^ "out" towards a serial output the chip to a second perimeter 
pin so that the scan bit may be compared to an expected 
response. If this triggered response or "scanout" varies 

25 from the expected response, then that particular logic 
element may be a cause of the noted problem. 

•-■-To-TtdocixiLLtQly debug a chip, it may be necessary to view 
a sampled state of hundreds of chip-internal signals within a 
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space of minutes. Conventionally, - a f - d br^b debugging 

technique may be used to isolate a probable bug location from 

a million transistors to a group of a few hundred 

transistors. Then, a second debugging technique may be used 

5 to rapidly find the exact transistor failure point from the 

grouped few hundred transistors. Chips may have redundant 

transistors in them such that, once a transistor failure 

point is located, the transistor may be turned off as part of 

chip production while a redundant transistor may then 

10 similarly be turned on. 

fey ^a^pk , -H> • $o\Q+<- 

- i P o isoiba ^e. a probable bug location from a million 
rg transistors to a group of a few hundred transistors, an 

integrated circuit chip may be designed to include internal 
read-only test points. These test points (or scanout 
15 "cells") generally are scattered throughout the integrated 
circuit chip. When chained together to form a distributed 
shift register, scanout cells produce parallel data that 
provides observability of selected nodes during functional 
[3 testing during normal operation of the chip. 

20 U.S. patent 5,253,255 teaches a centralized control 

mechanism that employs two global test control lines for each 
vr^ design for test (DFT) feature. -Horg,^ sequentially activating 
the snapshot and the shift with signals requires two global 
speed critical signals that require close timing tolerances 
25 between the two signals. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is illustrated by way of example and not 
by way of limitation in the figures of the accompanying 
drawings in which like references indicate similar elements. 
It should be noted that references to "an" or "one" 
embodiment in this disclosure are not necessarily to the same 

embodiment, and such references mean at least one. 

Of an eryibad'me/rf of4t\* t r\\kn-hdr\ 

Figure 1 illustrates platform 100 of the i-mrention 

having chip 102 disposed on support structure 104; 
Figure 2 illustrates chip 200; 

Figure 3 illustrates chip 300 as a conventional 
modification of chip 200 of Figure 2; and . *r:4JLz, , ^ 

Figure 4 illustrates chip 400 of the invention . 

Figure 5 illustrates a global and local view of scanout 
control in an embodiment of the invention. 

Figure 6 illustrates logic equations and details of 
control line signals of an embodiment of the invention. 

Figure 7 illustrates scanout clock waveforms for 
shifting a scanout chain. 

Figure 8 illustrates scanout clock waveforms for a 
snapshot on an external trigger. 
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DETAILED DESCRIPTION OF THE INVENTION 

Figure 1 illustrates platform 100 of the invention 
having chip 102 disposed on support structure 104. Platform 
5 100 may be any structure having electronic components, such 
as chip 102. For example, platform 100 may be a computer or 
computer system. Moreover, platform 100 may be a printed 
circuit board (PCB) . By way of background, a computer or 
other electronic system might be built from several PCBs, 
10 such as processor, memory/ graphics controller, and disk 

u onlrollc^ These boards might all plug into a motherboard 
or backplane or be connected by a ribbon cable. 
yi Chip 102 may be referred to as a processor or integrated 

circuit and may be thought of as a microelectronic 
2 15 semiconductor device having many interconnected transistors 
H and other components. Platform 100 may also include memory 

til controller 106, Peripheral Component Interconnect (PCI) 

CO bridge 108, and chip 110, each communicatively coupled 

□ through processor bus 112. 

2 0 Memory controller 106 may be coupled to memory chips 

114. A controller may be thought of as that part of platform 
100 that allows platform 100 to use certain kinds of devices. 
Thus, memory controller 106 may be thought of as that part of 
platform 100 that allows platform 100 to use memory chips 
25 114. Memory chips 114 may be any device that can hold data 
in machine-readable format. 

PCI is a standard to connect external or peripheral 
devices to a personal computer. The PCI standard entitled 
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PCI to PCI Bridge Architecture Specification (Rev. 1.1, PCI 
Special Interest Group, Portland, Oregon, 1998) may be 
implemented in a mezzanine or a bridge, such as PCI bridge 
108. Here, PCI bridge 108 may include buffers to decouple 
5 chip 102 from relatively slow peripherals and to allow the 

peripherals to operate asynchronously. PCI bridge 108 may be 
coupled to external devices through controller 122 over PCI 
bus 124. These external devices may be input devices such as 
keyboard 116, mouse 118, and modem 120. 

10 Chip 110 may be an integrated circuit that is similar to 

chip 102.- Chip 102 may include bus interface unit (BIU) 126 
and test controller 128. BIU 126 may operate as an 
input /output port to js emmus - i c a t ed signals between processor 
bus 112 and chip 102. Test controller 128 may be any device 

15 that asserts test instructions to design for test (DFT) 
features, where DFT may be design and hardware features 
incorporated into chip 102 to aid in manufacturing and 
debugging . 

Also coupled to platform 100 may be tester 130. Tester 
20 13 0 may be a VLSI tester that communicates test instructions 
to test controller 128 over bus 132. Bus 132 may be thought 
of as a test and maintenance bus used as part of the 
interface design to access chip 102 . Institute of Electrical 
and Electronics Engineers (IEEE) standard 1149.1, entitled 
25 Test Access Port and Boundary-Scan Architecture, is an 

internationally recognized design standard specifying product 
^(design and test protocols. Particularly, IEEE 1149-l^jfiay 
^ap£i> ify ■ the intorfaoc desi g n Lu auucofe a chip for testing 



042390 . P9572X Express MaiJ^Jo . EM560646872US 



0 



purposes. In one embodiment, bus 132 meets the five-pin 
requirement of IEEE standard 1149.1. Moreover, under IEEE 
1149.1, a test access port (TAP) may be an example of test 
controller 128. 

5 Figure 2 illustrates chip 200. Chip 200 may include 

logic units 202, 204, 206, 208, 210, and 212, each of which 
may deal with basic operations of a computer system. For 
example, logic unit 2 02 may be a floating point unit (FPU) , 
logic unit 204 may be an arithmetic logic unit (ALU) , and 

10 logic 212 may be a memory logic. Each logic unit may be 
coupled to bus interface unit 214 through logic unit 
controllers 216, 218, 220, 222, 224, and 226 as shown. These 
controllers local to the logic units of chip 200 may be 
positioned very close to their respective logic unit. 

15 Moreover ,. bus interface unit (BIU) 214 may be similar to BIU 
12 6 of Figure 1. 

Clock generator 227 may be a cor e -clack that: is 
connected between each logic unit controller and BIU 214. 
-e iuck clock generator 227 may produce a signal that may be 

2 0 distributed over clock lines within chip 2 00 in a tree like 
structure to the logic units 202, 204, 206, 208, 210, and 
212. Each clock signal of a given clock pulse may need to 
reach its destination within a timing window for events 
within chip 200 to be synchronized. Timing skew may be 

25 thougHP, of t+io variation between the arrival moment of a 

first signal as compared to the arrival moment of one or more 
other signals at the same or different logic unit. 
Controllers may be placed within chip 2 00 to control this 
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timing skew. These deskew (DS) e onta?olle r* s - may include a 

whole host of logic elements that aid in receiving, 
processing, and retransmitting signals such as clock signals 
-aa* e iiisLiucl* -Q&s. to the logic units so as to synchronize the 
arrival of all instructions to their associated logic unit. 
In other words, DS controllers may aid in minimizing any 
timing skew between the signals. Logic unit controllers 216, 
218, 220, 222, 224, and 226 may be DS controllers. As will 
be discussed below, - th e -invention takes advantage of logic 
unit controllers 216-226 to receive, process, and retransmit 
testing signals to reduce the number of global test control 
lines . 

Part of producing a working chip includes testing and 
debugging that chip. Debugging is an attempt to determine 
the cause of any malfunction symptoms detected by testing. 
Circuitry may be built into an integrated circuit to assist 
in the test, maintenance, and support of an assembled 
circuit. Hardware features, known as design for test (DFT) 
features, may be incorporated into a chip to aid in testing 
and debugging. 

Each logic unit within chip 200 may be coupled to cells 
chained together to form a register. For example, logic 208 
may be coupled to register 228 having cells 230, 232, 234, 
and 236. Logic 212 may be coupled to register 238 having 
cells 240, 242, 244, 246, 248, and 250. Cells 230-236 and 
240-250 may be examples of DFT features. 

^-MTfo i. p ?&ft T ^^y-^^i f -i nn ^i ]y i^ni nHn. test controller 2 52 
and tester 254. Test controller 252 may be viewed as an 
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integrated test controller when residing within chip 200. 

Included within test controller 252 may be an instruction 

register (IR) and a test access port finite state machine 

(TAP FSM) as described in IEEE 1149.1. Test controller 252 
5 and tester 254 may be similar to test controller 128 and 

tester 130 of Figure 1, respectively. 

Figure 3 illustrates chip 300. Chip 300 may be a 

conventional modification of chip 200 of Figure 2. 

Conventionally, one global test control line is required to 
10 run between test controller 252 and a DFT feature for each 

type of instruction signal. For example, to transmit a load 
•ft signal and a test signal from integrated test controller 

m (ITC) 252 and register 238 of Figure 3, chip 300 may employ 

fQ global test control lines 3 02 and 3 04, respectively. In a 

^ 15 similar way, lines 306 and 308 may run between ITC 252 and 
f. register 22 8. In other words, for a scanout process, each 

lH DFT feature conventionally may employ two global test control 

W lines. For a different type of testing process, each design 

C3 for test Feature conventionally may employ three, four, five 

2 0 or more global test control lines. 



l o riall y *! £nGLv&& fifty* 



to^one hundred global 



test control lines to transmit information to anywhere from 
5,200 DFT features to 49,000 DFT features. These global test 
control lines may limit chip designers in the placement and 
25 arrangement of the control /lines and logical units on chip 
300 by taking up valuable /space on chip 300. More 
restrictively , the timing of the signals distributed by each 
of these global test control lines is critical. As noted 
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below, the invention may reduce the number of global test 
control lines to nineteen (for example, one bus having 
nineteen lines) to transmit that s same information to the 
design for test features of a chip. 

JiguroHl illustrates chip 400 of the invention. As can 
be readily seen, chip 400 may be a modification of chip 200 

gf Figure 2. Included with chip 400 may be internal test 
uS ^X(5\oba\ C6rAto\ woe) 

bus disposed between ITC 252 and logic unit controller 
226 and internal test bus 404 disposed between ITC 252 and DS 
controller 222. Chip 400 may also include additional 
internal test buses (not shown) disposed between ITC 252 and 
each logic unit controller within chip 400. 

mavf Pass 

Each internal test bus (ITB) of chip 400 may ■ bo adapted 
^fe o - paoc test instruction signals as a test instruction packet 
from ITC 252 to a logic unit controller. Although the bits 
of these signals may travel over different lines, these 
different lines may be within a single internal test bus 
having a single routing path between ITC 252 and a logic unit 
controller. Moreover, these test instruction signals may 
travel according to a clock that is internal to tester 2 54 
rather than the core clock that is internal to chip 400, here 
clock 227. For these and other reasons, timing is not 
critical with respect to signals routed within an internal 
test bus in one embodiment of the invention. 

To transmit a test information packet in parallel, each 
internal test bus of chip 400 may include n number of lines 
such that 

n = a + log 2 i (500) 
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where 

n - number of lines, 

a = number of ancillary transmission bits, and 
log 2 i ( = number of instruction bits. i. 

Ancillary transmission bits may be those supporting bits 
that may^needed to accompany the instruction bits. For a 
given distributed test control architecture compliant with 
IEEE 1149.1, the number of ancillary transmission bits may be 
a constant. The number of instruction bits (log 2 i) may be 
thought of as a bit stream of zeros and ones. Moreover, the 
number of instruction bits (log 2 i) may represent the number of 
unique testing tasks that are desired to be performed within 
the collective of logic unit controllers (e.g., 222 and 226) 
of chip 400. For example, where the number of instruction 
bits equals eight (8 = log 2 256) , the test instruction packet 
may include up to . two hundred and fifty six (256) unique 

testing task signals. The instruction bits may be disposed 

Coy>n Pliant :nS+ruc*<jn tts>^Hr 

within an IEEE 1149.1 instruction - rogicto - r . 

In one embodiment, the information transmitted over 
internal test bus 402 may include a shift signal and a load 
signal. In another embodiment, the information transmitted 
over internal test bus 402 may i nclude a- .one bit clock Ci ign a^ 
and the following five components representing eighteen bits: 

(i) instruction register contents (8 bits) ; 

(ii) partially encoded, relevant states of a test access 
port finite state machine (TAP FSM) (4 bits) ; 

(iii) security-bit (1 bit) ; 

(iv) test data input (TDI) (1 bit); and 
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(v) counter value from a Wave Shaper (4 bits) . 
Component (i) may be thought of the number of 
instruction bits (log 2 i) and components (ii) , (iii) , (iv) , and 
(v) may be thought of as ancillary transmission bits (a) . 
Accordingly, from equation 500 above, 



These nineteen bits of information may travel as a test 
information packet. Where the toot inctruct - i r on - packet is 
encoded, the logic unit controller may include components 
that decode the test instruction packet. 

Not all the states of the test access port finite state 
machine (TAP FSM) need be transmitted over a test bus (402, 
404) of chip 400. This may be true where the state in 
question is not relevant to control DFT logic internal to 
chip 400. In one embodiment, the relevant states of TAP FSM 
under IEEE 1149.1 include the following six states: test- 
logic-reset (tlr) , run- test/idle (rti), capture-down register 
(capDR) , shift-DR (shiftDR) , update-DR (updtDR) , and dead (the 
remaining eleven states) . Accordingly , in one embodiment of 
the invention, a subset of all the states of a test access 
port finite state machine are encoded into three bits. In 
another embodiment, individual bits may be allocated for 
test-logic-reset (tlr) and run- test/idle (rti) and the 
residual (remainder) states (capDR, shiftDR, updtDR, and 
dead) are allocated among two bits. 

On receiving a test instruction packet, a logic unit 
controller may locally generate whatever testing signals are 
needed for the desired testing operation. For example, on 



19 = (1+4+1+1+4) + log 2 256 



(500) . 
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receiving a test instruction packet having a shift signal and 
a load signal, logic unit controller 22 6 may process the test 
instruction packet to locally generate a shift signal and a 
load signal. Logic unit controller 226 may then pass these 
5 two signals to register 238 over the relatively short 

distance of distributed test line 406 and distributed test 
line 4 08, respectively. 

Under the conventional method of Figure 3, a shift 
signal and a load signal each travel separately over its own 
10 global test control line between the ITC 2 52 and register 

238. The independent travel of these and other test signals 
over a relatively great distance requires tight or critical 

in control over the timing of these signals. In contrast, one 

riJ/T Senate ±0$Aku, a$ 

}f%\}\ * embodiment of the invention transmits these - signalo o r s an 

^2 15 instruction bundle or packet between the ITC 252 and logic 

B unit controller 22 6 so as to eliminate the number of global 

lH test control lines and the requirement for critical timing. 

CO One embodiment of the invention may be employed as a 

□ method to control at least one DFT feature, such as register 

20 238 of Figure 4. A test information packet may first be 

generated in a test controller of an integrated circuit. The 

test information packet may then be transmitted to at least 

one logic unit controller over a test bus coupled between the 

test controller and the at least one logic unit controller. 

25 The test information packet may then be processed within the 

at least one logic unit controller to generate at least one 

test control signal. The at least one test control signal 

may be transmitted to the at least one DFT feature coupled to 
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the logic unit controller. A logic unit coupled to the at 
least one DFT feature may be interacted with based on the at 
least one test control signal. 

In another embodiment, a single speed critical global 
control signal and two locally generated control signals are 
used for debug and testability. The two local control 
signals are generated by local controllers, and are not speed 
critical. Therefore, a partially-distributed control scheme 
is formed. This is due to having two signals that are local 
(distributed control) , and one signal that is global. As was 
presented above, conventionally two global signals are used 
for debug and testability. 

Since a snapshot and shift instruction are 
conventionally handled as part of a single instruction, the 
timing of the two global control signals conventionally is 
very time critical and a close timing tolerance is necessary. 
In contrast, in one embodiment the snapshot and shift 
instructions are partitioned into separate operations. 
Therefore, a snapshot can be performed and the results can be 
shifted out after a period of time. Also, trigger mechanisms 
are implemented that improve debugability on an integrated 
circuit . 

Figure 5 illustrates global view 500 and local view 505 
of scanout control. Also illustrated in Figure 5 is debug 
unit (DBG) 510, ITC 520, ITB 530, and DS controller 540. 
Conventionally, five (5) scanout signals are defined as 
f o 1 1 ows : SCANOUTRST , SC ANOUTLO AD , SC ANOUTLOADDBG , 
SCANOUTSHIFT and SCANOUTSIG. It should be mentioned that 
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SCANOUTLOAD instructions are used for a snapshot. 

SCANOUTRST resets the scanout chain by setting both load 
and shift controls to 0. SCANOUTLOADDBG switches scanout 
cells to load mode and then waits for trigger from the DBG 
5 unit 510 illustrated in Figure 5. SCANOUTLOAD switches 
scanout cells to load mode and a snapshot occurs in Run- 
test/Idle on a low- to-high transition of TDI . SCANOUTSHIFT 
switches scanout cells to shift mode and shifts captured data 
in shift-DR (data register) operation sequence. SCANOUTSIG 
10 sets both load and shift controls of scanout cells to 1 and 
then enables the clock to scanout cells.. The TAP controller 
then enters a Run- test/ Idle state and waits until a debug 
trigger enabled by (functional) code. 



15 control line signals scanout_load, scanout_shif t , ck, and 

scanout_ctl. Both scanout_load and scanout_shif t signals are 
generated locally by decoding the corresponding instructions 



in the DS controllers and dio ^ fcribut o d ^to the regional clock 
drivers (RCD) . Both signals have relaxed timing constraints 

2 0 since one global signal, scanout_cntl , is used. The one 
global scanout_cntl signal is generated in the ITC and is 
routed as a critical signal (staged in the ITC and the DS 
controllers) to all the DS controllers. The scanout_cntl 
signal gates qclk (queue clock) in lbfs (last buffer first 

25 serve) that contain scanout cells to generate scanout_clk to 
the scanout cells. The scanout_cntl signal is a function of 
the scanout instructions, the TAP FSM state, input from the 
debug unit (dbgsnapshot when SCANOUTLOAD is used) , rising 



Figure 6 illustrates logic equations and details of 
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transition on TDI (when SCANOUTLOADDBG is used) and TCK (test 
clock) (for SCANOUTSHIFT) . 

When SCANOUTLOADDBG is loaded, the trigger input is 
provided on a debug pin (DP) . The debug unit (DBG) is 
5 programmable to respond to the trigger input after a variable 
latency. The DBG triggers a signal (dbgsnapshot ) that is an 
input to the ITC . The ITC in turn triggers the scanout_cntl 
signal, which causes a snapshot. This feature, programmable 
debug, is very powerful for debugging purposes. 

10 When SCANOUTLOAD is loaded, the trigger input is 

provided by a low- to-high transition on the TDI pin. This 
provides an additional model of snapshot that is a failsafe 
option and is independent of the DBG. Figure 7 illustrates 
scanout clock waveforms for shifting a scanout chain (one 

15 shift per clock) . Figure 8 illustrates scanout clock 

waveforms for a snapshot on an external trigger. i 



^ The distributed test control scheme of the invent i ron 

works towards reducing the number of global test control 
lines, relaxes routing constraints on the test control lines, 

2 0 and adds greater flexibility in the physical placement of the 
test controller and test control logic. This translates to 
lower silicon area and reduced design effort (cost, 
efficiency, quality, reliability, and timeliness) . Moreover, 
the distributed test control scheme of the invention is 

25 scalable and flexible; that is to say, the distributed test 

control scheme may include the ability to add support for new 
test features late in the design cycle and implement fixes 
with relatively small impact on schedule. 
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The above embodiments can also be stored on a device or 
medium and read by a machine to perform instructions. The 
device or medium may include a solid state memory device 
and/or a rotating magnetic or optical disk. The device or 
medium may be distributed when partitions of instructions 
have been separated into different machines, such as across 
an interconnection of computers. 

The exemplary embodiments described herein are provided 
merely to illustrate the principles of the invention and 
should not be construed as limiting the scope of the subject 
matter of the terms of the claimed invention. The 
specification and drawings are, accordingly, to be regarded 
in an illustrative rather than a restrictive sense. 
Moreover, the principles of the invention may be applied to 
achieve the advantages described herein and to achieve other 
advantages or to satisfy other objectives, as well. 
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